Statistical model of intra-chromosome contact maps
نویسندگان
چکیده
The statistical properties of intra-chromosome maps obtained by a genome-wide chromosome conformation capture method (Hi-C) are described in the framework of the hierarchical crumpling model of heteropolymer chain with quenched disorder in the primary sequence. We conjecture that the observed Hi-C maps are statistical averages over many different ways of hierarchical genome folding, and show that the existence of quenched primary structure coupled with hierarchical folding can induce the observed fine structure of intra-chromosome contact maps. The analysis of DNA folding in human genome based on the genome-wide chromosome conformation capture method [1, 2] provides a comprehensive information on spatial contacts between genomic parts and imposes essential restrictions on available 3D genome structures. The Hi-C maps obtained in the experiments on various organisms and tissues [2–8] (some examples are shown in the Fig.1A-C) define the average contact probability, P(s) between two particular parts of genome separated by the (genomic) distance s along a chain. These maps display very rich structure in a broad interval of scales, where P(s) ∼ 1/s (see the Fig.1D) in consistency with the so-called fractal (or " crumpled ") globule [9] model of polymer chain packing. The latter is characterized by two main features: (i) it has almost no knots, and (ii) it is self-similar in a wide range of scales, forming a fractal space-filling structure. Both these properties are essential for genome folding: fractal organization makes genome tightly packed at all scales, while the lack of knots ensures easy and independent opening and closing of genomic domains, necessary for transcription [10, 11]. In 3D this packing results in a space-filling conformation of a chain with fractal dimension D f = D = 3. The Hi-C contact probability, P i,j , between two ge-nomic units i and j depends in a complicated way on a combination of structural and energetic factors. However , in a fractal globule with D f = 3 the average contact probability, P(s) = (N − s) −1 N −s i=0 P i,i+s , between two units separated by the genomic distance s = |i − j| should decay as P(s) ∼ s −1. This probability can be naively estimated as P(s) ∼ 1/R D (s), where R(s) is the typical size occupied by the s–fragment. For curves with the fractal dimension D f , one has R(s) ∼ s 1/D f , hence, P(s) ∼ s −D/D f , and for …
منابع مشابه
Transferable model for chromosome architecture.
In vivo, the human genome folds into a characteristic ensemble of 3D structures. The mechanism driving the folding process remains unknown. We report a theoretical model for chromatin (Minimal Chromatin Model) that explains the folding of interphase chromosomes and generates chromosome conformations consistent with experimental data. The energy landscape of the model was derived by using the ma...
متن کاملPhysical origin of the contact frequency in chromosome conformation capture data.
Physical proximity between each pair of genomic loci in a nucleus is measured as a form of contact frequency in chromosome conformation capture-based methods. Complexity of chromosome structure in interphase can be characterized by measuring a statistical property of physical distance between genomic loci according to genomic separation along single chromatids. To find a relationship between th...
متن کاملTitle: A Transferable Model For Chromosome Architecture
In vivo, the human genome folds into a characteristic ensemble of three-dimensional structures. The mechanism driving the folding process remains unknown. We report a theoretical model for chromatin (Minimal Chromatin Model) that explains the folding of interphase chromosomes and generates chromosome conformations consistent with experimental data. The energy landscape of the model was derived ...
متن کاملPerformance evaluation of EPM and MPSIAC Models for determination of Erosion Status of Shahriari Watershed
Soil erosion is one of the most important environmental issues in developing countries, including Iran that there is inaccurate information about its amount and distribution. For this purpose, the accuracy and distribution of erosion classes obtained from EPM and MPSIAC models as compared to BLM as ground truth values were evaluated in Shahriari watershed. First, the required data and informati...
متن کاملStatistical Properties of Contact Maps
A contact map is a simple representation of the structure of proteins and other chainlike macromolecules. This representation is quite amenable to numerical studies of folding. We show that the number of contact maps corresponding to the possible configurations of a polypeptide chain of N amino acids, represented by (N21)-step self-avoiding walks on a lattice, grows exponentially with N for all...
متن کامل